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The nucleotide sequence of the 3'-end of the genomic RNA of sialodacryoadenitis virus strain 681 (SDAV-681) was 
determined. A large open reading frame encoding a 454-amino-acid protein was identified as the nucleocapsid protein 
(N) gene, since the predicted protein is similar in size, chemical properties, and amino acid sequence to the N proteins 
of other coronaviruses. The amino acid variance of the N proteins between SDAV and mouse hepatitis virus (MHV) is 
not markedly different from that among MHV strains. A high degree of genetic relatedness between SDAV and MHV 
was revealed in the intergenic and 3'-noncoding sequences as well as in the N gene. © 1993 Academic Press, inc. 


Sialodacryoadenitis virus (SDAV) is a member of the 
family Coronaviridae and is known to be an important 
pathogen which infects laboratory rats at high preva- 
lence, causes clinically significant disease associated 
with rhinitis and sialodacryoadenitis, and interferes 
with research that uses rats. It shares antigenic rela¬ 
tionships with other rodent coronaviruses, such as 
Parker’s rat coronavirus (PRCV) and mouse hepatitis 
virus (MHV) (/). The genomic organization and the 
unique strategy of replication of MHV have been well 
characterized (2). The virion contains a single- 
stranded, positive-sense RNA genome of approxi¬ 
mately 32 kilobases (kb). The genome is polyadenyl- 
ated and organized into seven regions, each contain¬ 
ing one or more open reading frames (ORFs). In 
infected cells, a genomic-sized mRNA and six subge- 
nomic mRNA species are synthesized which form a 
nested-set with a common 3'-terminus but extending 
to different lengths in the 5'-direction. Only the 5'-proxi- 
mal gene of each mRNA is utilized for translation. The 
genes encoding the structural viral proteins have been 
identified; the spike glycoprotein (S) gene is third from 
the 5'-end (gene 3), the membrane glycoprotein (M) 
gene is sixth from the 5'-end (gene 6), and the nucleo¬ 
capsid protein (N) gene is the 3'-terminal gene (gene 7). 
Additionally, the hemagglutinin-esterase glycoprotein 
(HE) gene (gene 2-1) is translated by only some of the 
MHV strains (3). In contrast to MHV, the molecular biol¬ 
ogy of SDAV is not understood. To elucidate the struc¬ 
ture and the molecular biological properties of SDAV, 
we have initiated sequence analysis of the SDAV ge¬ 

1 Sequence data from this article have been deposited with the 
DDBJ, EMBL, and GenBank Libraries under Accession No. D10760. 

2 To whom reprint requests should be addressed. 


nome. We report here the complete nucleotide se¬ 
quence of the N gene of SDAV-681 and its relatedness 
to other coronaviruses. 

SDAV-681 (/) was propagated in a rat mammary tu¬ 
mor cell line (LBC) as described by Hirano et ai. (4). 
Virus was purified by centrifugation on a discontinuous 
sucrose gradient consisting of 60 and 30% (w/w) su¬ 
crose in TNE buffer (50 m/W Tris-HCI, pH 7.5, 100 mM 
NaCI, 1 mM EDTA) at 110,000 g for 2 hr at 4°. The 
virus recovered from the interface between 60 and 
30% sucrose was incubated in TNE buffer containing 
0.5 mg/ml proteinase K and 0.5% SDS for 10 min at 
56°. The genomic RNA was extracted with phenol- 
chloroform-isoamyl alcohol and precipitated with eth¬ 
anol. Double-stranded cDNA was synthesized from 
SDAV genomic RNA primed with oligo(dT), using 
cDNA synthesis system plus (Amersham). After addi¬ 
tion of BamVW adaptor (Pharmacia) to blunt-ended 
cDNA, the cDNA was purified on a cDNA spun column 
(Pharmacia) and ligated into the BamHI-site of plasmid 
vector pUCl9 (Takara). The ligated molecules were 
used to transform Escherichia coli JM109 and cells 
containing recombinant plasmid were selected on LB 
agar plates that contained 50 ^9 ampicillin/ml, 1 mM 
IPTG, and 0.01% X-gal. Recombinant colonies were 
transferred to nitrocellulose filters and probed with 32 P- 
(abeled MHV-N gene cDNA (1362 nucleotides) pre¬ 
pared from MHV-A59 genomic RNA by polymerase 
chain reaction (PCR) described previously (5). Colonies 
yielding strong signals were analyzed for insert size. 
Five clones containing 0.4- to 1.7-kb inserts (# 1 —#5 in 
Fig. 1 A) were sequenced in both directions by the di- 
deoxy method using modified T7 DNA polymerase (Se- 
quenase, U.S. Biochemical). An additional cDNA clone 
(#6 in Fig. 1 A) was obtained using a synthetic oligonu- 
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5•-TTGTTGAGAATCTAA TCTAAAC TTTAAGG ATG TCTTTTGTTCCCGGACAA 5 0 

GAAAACGCCGGTAGCAGAAGCTCCTCTGGAAACCGCGCTGGTAATGGAATCCTCAAGAAG 110 
ACCACTTGGGCTGACCAAACCGAGCGCGGACAAAATAATGGAAATAGAGGCAGAAGGAAT 170 
CAGCCCAAGCAGACTGCAACTACTCAGCCCAATACCGGGAGTGTGGTTCCCCATTACTCT 230 
TGGTTTTCGGGCATTACCCAATTCCAGAAGGGAAAAGAGTTCCAGTTTGCAGGTGGACAA 290 
GGAGTGCCTATTGCCAATGGAATCCCACCTTCTGAGCAAAAGGGATATTGGTATAGACAC 350 
AACCGTCGTTCTTTTAAAACACCTGATGGGCAGCAGAAGCAACTACTCCCCAGATGGTAT 410 
TTTTACTATCTTGGGACGGGCCCCCATGCTGGAGCCAGTTTCGGAGACAGCATTGAGGGA 470 
GTCTTCTGGGTTGCAAATAGTCAGGCGGATACCAACACCTCTGCTGACATTGTTGAAAGG 530 
GACCCAAGTAGCCATGAGGCTATTCCTACTAGGTTTGCGCCCGGTACGGTATTGCCTCAG 590 
GGTTTCTATGTTGAAGGCTCGGGAAGGTCTGCACCTGCTAGTCGATCTGGTTCGCGGTCA 650 
CAATCCCGTGGGCCAAATAATCGCGCTAGAAGCAGTTCCAACCAGCGCCAGCCTGCCTCT 710 
ACTGTAAAACCTGATATGGCCGAAGAAATTGCTGCTCTTGTTTTGGCTAAGCTAGGCAAA 770 

'WW 

GATGCCGGACAGCCTAAGCAAGTAACTAAGCAAAGTGCCAAAGAAGTCAGGCAGAAAATT 830 
TTAAATAAGCCTCGCCAAAAGAGGACTCCAAACAAGCAGTGCCCAGTGCAGCAGTGTTTT 890 
GGAAAGAGAGGCCCCAATCAGAATTTTGGAGGCCCTGAAATGTTAAAACTTGGAACTAGT 950 
GATCCACAGTTCCCCATTCTTGCAGAGTTGGCCCCAACACCTGGTGCCTTCTTCTTTGGA 1010 
TCTAAATTAGAATTGGTCAAAAAGAATTCTGGTGGCGTTGATGAACCCACCAAAGATGTG 1070 
TATGAGCTGCAATATTCAGGTGCAGTCAGATTTGATAGTACTCTACCTGGTTTTGAGACT 1130 
ATCATGAAAGTGTTGAATGAGAATTTGAACGCCTACCAGAATCAAGCTGGTGGTGCAGAT 1190 
GTAGTGAGCCCAAAGCCCCAAAGAAAGAGAGGGACGAAACAAACGGCTCAGAAAGAAGAA 1250 
TTAGATAGTATAAGCGTTGCAAAGCCCAAAAGTGCCGTGCAGCGAAATGTAAGCAGAGAA 1310 
TTAACCCCAGAGGATAGAAGCCTGTTGGCGCAGATCCTAGATGATGGCGTTGTGCCTGAT 1370 
GGGTTAGATGACTCTAATGTGTAAAGAGAATGAATCCTATGTCGGCGCTCGGTGGTAACC 1430 
CCTCGCGAGAAAGTCGGGATAGGACACTCTCTATCAGAATGGATGTCTTGCTGTCATAAC 1490 
AGATAGAGAAGGTTGTGGCAGACCCTGTATCAATTAGTTGAAAGAGATTGCAAAATAGAG 1550 
AATGTGTGAGAGAGTGTAAGGTCCTACGTCTAACCATAAGAACGGCGATAGGCGCCCCCT 1610 
GGGAAGAGCTCACATCAGGGTACTATTCCTGCAATGCCCTAGTAAATGAATGAAGTTGAT 1670 
CATGGCCAATTGGAAGAATCAC-POLY(A)-3' 1692 

Fig. 1. Sequencing of the 3'-end of the SDAV-681 genome. (A) cDNA clones and sequencing strategy. Clone #1 was sequenced completely 
from subcloned fragments generated by restriction endonuclease Oral. Clones #2-#6 were sequenced partly their 5'- or 3'-termini. (B) The 
primary nucleotide sequence of the 3'-end (1692 nucleotides) of the SDAV-681 genome. The initiation and termination codons for the N open 
reading frame are underlined and those for the internal open reading frame are shown by the wavy lines. The intergenic consensus sequence 
(TCTAAAC) and the 3'-conserved sequence (GGAAGAGC) are shown by the double lines. The region indicated by the dotted line was used for 
preparing a complementary oligonucleotide primer to obtain clone #6. 
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Fig. 2. Amino acid sequence comparison of the N proteins of SDAV 681 and five MHV strains. The deduced sequence of the SDAV-681 N 
protein is from this work. The sequences for MHV-2 and -Nu67 were obtained from recent work (5). The MHV-A59 and -S sequences are taken 
from Parker and Masters (6) and the MHV-JHM sequence is taken from Skinner and Siddell (7). Spaces indicate amino acids identical to the 
SDAV-681 sequence. Hyphens indicate gaps introduced to maximize alignment. The spacer regions A and B (amino acid positions 140-162 
and 381-406 in the SDAV-681 sequence, respectively) proposed by Parker and Masters (6) are bracketed. 


cleotide primer complementary to nucleotides 183- 
202 (Fig. 1 B) for cDNA synthesis and then sequenced 
in the same manner. 

The nucleotide sequence of the 3'-end of SDAV-681 
genome covering the complete ORF of 1362 nucleo¬ 
tides is presented in Fig. IB. This ORF extends from 
nucleotides 30 to 1391 and potentially encodes for a 
protein of 454 amino acids. The predicted protein is 
highly homologous to the N proteins of MHV strains 
(MHV-A59, -JHM, -2, -S, and Nu67) aligned in Fig. 2. 
Nucleotide and amino acid sequences of SDAV-681 
and MHV strains are 89.0 to 92.7% and 90.1 to 93.8% 
identical, respectively. Incidentally, the amino acid se¬ 
quence identity among the N proteins of MHV strains is 
91.8 to 97.1%. Coronaviruses can be divided into sev¬ 
eral antigenic groups. SDAV belongs to a group includ¬ 
ing MHV, bovine coronavirus (BCV), human corona- 
virus strain OC43 (HCV-OC43), and hemagglutinating 
encephalomyelitis virus of swine (HEV). The N protein 
of MHV shows an amino acid sequence homology of 
approximately 70% with BCV (8) and HCV-OC43 (9). 
The high degree of sequence identity between SDAV 


and M HV shown in this study confirmed the close rela¬ 
tionship of these two viruses as revealed by serological 
analysis (/, 10). The predicted N protein of SDAV pos¬ 
sesses some features which are characteristic of the N 
proteins of coronaviruses. That is to say, the protein is 
highly basic, suggesting RNA binding property of this 
protein (/ /), and is rich in serine residues, which are 
presumed to be sites of phosphorylation (12). Sixty of 
the amino acid residues (13%) are basic and forty-two 
residues (9%) are acidic, giving the protein a net 
charge of +18 at neutral pH. There are thirty-nine ser¬ 
ine residues (9%) located at the almost same positions 
as MHV. 

SDAV-681 has an internal ORF in the +1 reading 
frame within the N gene (Fig. 1B}. This ORF potentially 
encodes a protein of 207 amino acids, which is basic (a 
net +7 charge at neutral pH) and rich in hydrophobic 
amino acids (50%) especially leucine residues (18%). 
An internal ORF of identical size has been found within 
the N genes of MHV and BCV (6, 8). Recently, it was 
demonstrated for BCV that a 23-kDa protein encoded 
by the internal ORF lying within the N gene was ex- 
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pressed in virus-infected cells [13). Further studies are 
required to reveal the functional significance of this 
protein and whether similar proteins can be detected 
for other coronaviruses. 

Noncoding regions flanking the N gene on SDAV ge¬ 
nome were also compared with those of MHV. The 
5'-noncoding sequence of 17 nucleotides (at nucleo¬ 
tide positions 13-29 in Fig. 1B) immediately before the 
initiation codon, including a stretch of consensus se¬ 
quence (TCTAAAC), is identical to the intergenic se¬ 
quence between the M and N genes of MFIV (14, 15). 
This stretch of consensus sequence is conserved 
among most coronaviruses, and among the 5'-end of 
genome, the intergenic regions, and the mRNA leader 
sequences of the same coronavirus. Thus it is the pro¬ 
posed leader RNA binding site for the initiation of 
mRNA transcription {2, 14, 15). The SDAV N gene is 
flanked on its 3'-side by a noncoding region of 301 
nucleotides (at nucleotide positions 1392-1692 in Fig. 
IB) and a poly(A) tail. The 3'-noncoding sequence of 
SDAV is highly homologous (more than 97% sequence 
identity) to MHV [6, 7) and contains the conserved se¬ 
quence, GGAAGAGC, 81 nucleotides from the 3'-end, 
which has been proposed as a recognition signal for 
negative-stranded RNA synthesis (2). 

It has been reported that high frequency RNA recom¬ 
bination occurs among MHV strains during coinfection 
both in vitro {16 ) and in vivo [17). This recombination 
event is thought to be caused by reattachment of 
pausing RNA intermediates during RNA synthesis at 
sites of high homology [2). The conservation of nucleo¬ 
tide sequences between SDAV and MHV in the inter¬ 
genic and 3'-noncoding regions as well as in the N 
gene suggests that these two viruses are at least struc¬ 
turally able to undergo RNA recombination with each 
other. Under experimental conditions, mice are sus¬ 
ceptible to intranasal and contact infection with SDAV 
(18, 19) and rats are susceptible to MHV by intranasal 
inoculation [20), but the cell tropisms of two viruses are 
considerably different. Therefore, RNA recombination 
between SDAV and MHV is unlikely to occur fre¬ 
quently, it may be possible, in natural infections. Parker 
and Masters (6) suggested that the N protein of MHV 
consists of three domains separated by two spacer 
regions A and B, which have less constraint on their 
amino acid sequences, and these spacers tend to vary 
among a limited set of two or three alternatives. Inter¬ 
estingly, SDAV-681 has an A59-like spacer A and a 
JHM-like spacer B (Fig. 2), implying that the N gene of 
SDAV-681 may have been derived from two prototype 


N genes of MHVs. It is possible that SDAV may be a 
rat-adapted virus arisen by recombination between 
two ancestral MHVs possessing different N genes or 
that recombination among an ancestral rat coronavi¬ 
rus and two ancestral MHVs may have given rise to 
SDAV-681. 

The results of the present study suggest a high de¬ 
gree of genetic relatedness between SDAV and MHV 
which is about the same as that among MHV strains. 
Comparison of the remaining genes with SDAV and 
MHV and detailed analysis of SDAV structural proteins 
will provide a clue to understand their evolution and the 
molecular basis of host cell tropism. 
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